arXiv: 1509.07007v2 [cs.DS] 5 Dec 2016 


Finding Perfect Matchings in Bipartite Hypergraphs 

Chidambaram Annamalai* 

December 6, 2016 


Abstract 

Haxell’s condition [Hax95] is a natural hypergraph analog of Hall’s condition, which is a well- 
known necessary and sufficient condition for a bipartite graph to admit a perfect matching. That 
is, when Haxell’s condition holds it forces the existence of a perfect matching in the bipartite 
hypergraph. Unlike in graphs, however, there is no known polynomial time algorithm to find 
the hypergraph perfect matching that is guaranteed to exist when Haxell’s condition is satisfied. 

We prove the existence of an efficient algorithm to find perfect matchings in bipartite hy¬ 
pergraphs whenever a stronger version of Haxell’s condition holds. Our algorithm can be seen 
as a generalization of the classical Hungarian algorithm for finding perfect matchings in bipar¬ 
tite graphs. The techniques we use to achieve this result could be of use more generally in 
other combinatorial problems on hypergraphs where disjointness structure is crucial, e.g. Set 
Packing. 
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1 Introduction 


Bipartite matchings are a ubiquitous quantity across science and engineering. The task of hnding a 
maximum matching (or, specihcally, a perfect matching) in a bipartite graph captures a fundamental 
notion of assignment that has turned out to have wide applicability. One reason their influence 
has been felt so deeply is essentially computational. The basic fact that maximum matchings in 
bipartite graphs can be found efficiently is arguably the most important factor in determining 
their widespread use. Conhrming the importance of this problem, decades of research in theoretical 
computer science has contributed to increasingly faster and more sophisticated algorithms for hnding 
maximum matchings in bipartite graphs. These include connections to other fundamental problems 
like matrix multiplication [Lov79, MS04]. Most recently, the exciting work of M^dry [Madl3] breaks 
the decades-old 0{my/n) Hopcroft-Karp-Karzanov [HK73, Kar73] barrier for hnding maximum 
matchings in bipartite graphs. 

In this paper we look at the analogous problem in the hypergraph setting. We address the 
following question: Do there exist efficient—in the sense of polynomial running time—algorithms 
to hnd perfect matchings in bipartite hypergraphs? 

In an r-uniform bipartite hypergraph the vertex set is partitioned into two sets A and B such 
that each edge contains exactly one vertex from A and r — 1 vertices from B. A perfect matching 
is a collection of disjoint edges such that each vertex in A is covered by exactly one edge in the 
collection. 

In order for the question to make any sense at all we need to impose some additional restrictions 
on the input as any such algorithm that works unconditionally even for the case r = 3 is tantamount 
to P=NP, as the trivial reduction from 3-DlMENSIONAL MATCHING^ shows. In this sense it is 
not surprising that the question of finding perfect matchings in bipartite hypergraphs was not 
considered before. However, as we will see, under certain conditions the problem becomes interesting 
algorithmically. The starting point for such investigations is to ask ourselves if there is a condition 
similar to Hall’s [Hal35] condition that guarantees the existence of perfect matchings in bipartite 
hypergraphs. This question was solved in a very satisfying way by Haxell [Hax95] in the mid-90s 
leading to a striking generalization of Hall’s theorem. The condition is that for every subset S of A, 
the size of the hitting set of hyperedges incident to S must be proportional to the size of S. This 
condition is sufficient to force the existence of a perfect matching in the hypergraph. More formally, 
given a bipartite hypergraph H = {A, B, E), for a set S' C H let Es := {e € E \ |e n S| = 1} be the 
set of hyperedges of H incident to S. For a given collection of edges F C E, define t{F) to be the 
smallest cardinality subset of B that hits^ all the edges in F. 

Theorem 1.1 (Haxell [Hax95]). Let H = {A,B,E) be an r-uniform bipartite hypergraph. If 

T{Es)>{2r-3){\S\-l) VSCH, 
then H admits a perfect matching. 

There are several interesting aspects to Theorem 1.1. First, it reduces to Hall’s condition when 
r = 2. Second, the statement is “tight” in the sense that it is not true when the strict inequality is 
replaced by a non-strict one, i.e., for every r there is an r-uniform bipartite hypergraph that satishes 
Haxell’s condition with non-strict inequalities and yet contains no perfect matching. Finally, in 

^For a definition of this NP-complete problem see, for example, [Cygl3]. 

^A subset S C B hits all the edges in F if for each e £ F, S Ci e 0. 
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constrast to the graph case, the proof is not constructive and does not lead to an efficient algorithm 
that also finds the perfect matching. 

Besides being interesting objects in their own right, perfect matchings in bipartite hypergraphs 
have become a crucial concept in recent work [BS06, Fei08, HSSll, AFS12, Svel2, PS12, AKS15] 
on a particular allocation problem, called RESTRICTED Max-Min Fair Allocation, where the 
goal is to partition indivisible resources among players in a balanced manner. The latest work that 
exploits this connection [AKS15] has further shown that local search algorithms based on alternating 
trees for this problem can be made to run in polynomial time. These developments brought to light 
the question posed towards the beginning of this section. Is it possible to efficiently find perfect 
matchings in bipartite hypergraphs by assuming a stronger version of Haxell’s condition? 

The question was also raised implicitly by Haeupler, Saha and Srinivasan [HSSll] who write 
that “Haxell’s theorems are again highly non-constructive ....” As a direction for future research it 
was therefore asked in [AKS15] if ensuring the much stronger condition 

t{Es) > 100r|5| VS C A (1) 

was sufficient to also find the perfect matching efficiently. The large constant before r reflected the 
belief that some loss is to be expected, at least by using their techniques. The allocation algorithm 
underlying their work turned out to yield a weaker guarantee under strengthenings similar to (1). 
In particular their techniques yield the following theorem. 

Theorem 1.2 ([AKS15]). Let c > 0 be some absolute constant. Choose any 0<e<l, r>2 
and consider r-uniform bipartite hypergraphs H = {A,B,E) that satisfy t{Es) > (c/e) • r(|S| — 1). 
For such a family of hypergraphs there is a polynomial time algorithm that assigns one hyperedge 
Ga ^ E for every vertex a € A such that it is possible to choose disjoint subsets {S^ C Cq Pi B}a£A 
of cardinality at least (1 — e)(r — 1). 

Whereas in the allocation setting sufficiently large subsets {SajagA also correspond to a good 
allocation, in a hypergraph setting Theorem 1.2 does not guarantee a collection of valid and disjoint 
hyperedges unless e < 1/r. However, for a choice of e in latter range, the assumption t{Es) > 
(c/e) • r{\S\ — 1) is much stronger than the one in Theorem 1.1. 

Our results Our main result is that a suitable constructivization of Theorem 1.1 is indeed pos¬ 
sible. We prove the following. 

Theorem 1.3. For every fixed choice o/e > 0 and r >2, there exists an algorithm A{e, r) that finds, 
in time polynomial in the size of the input, a perfect matching in r-uniform biparite hypergraphs 
H = (A,B,E) satisfying 

t{Es) > i2r - 3 + e){\S\ - 1) VS C A. 

Notice that such an algorithm with a polynomial running time dependence on 1 /e would be able 
to efficiently find perfect matchings in bipartite graphs only assuming Haxell’s condition. Currently 
we see no way of achieving such a result. In particular, our techniques make essential use of the 
e strengthening of Haxell’s condition as assumed in Theorem 1.3. We emphasize that the running 
time of A depends exponentially on r and 1/e, which explains the particular order of the quantifiers 
in the statement. See also Theorem 5.4 for a slightly stronger corollary of our main result. 

An outline of the ideas behind Theorem 1.3 requires setting up some context involving previous 
work, which we do presently. 
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Context It helps to start with the graph case. Here the basic augmenting algorithm (often called 
the Hungarian algorithm after Kdnig and Egervary [WesOl]) takes a partial matching and constructs 
an alternating tree of unmatched and matched edges with an unmatched vertex at the root. For 
convenience we can imagine this tree partitioned into “layers”, where the ith layer contains all vertices 
at distance 2i — 1 and 2i from the root (along with their associated edges in the tree). Matched 
edges appearing in the tree can also be called “blocking” since they prevent us from augmenting the 
partial matching immediately. If a leaf of the alternating tree happens to be an unmatched vertex 
(i.e., the leaf edge is not blocked by some edge in the partial matching) then the corresponding root 
to leaf path is an augmenting path for the considered matching and thus the augmenting algorithm 
terminates. The fact that such a leaf always exists is guaranteed by Hall’s condition [Hal35]. The 
proof of Haxell’s theorem (Theorem 1.1) involves a similar alternating tree, the key difference being 
that a single hyperedge in the alternating tree may now be blocked by several hyperedges (up to 
r — 1) from the partial matching. Therefore, even if a leaf hyperedge in the tree does not intersect 
any hyperedges from the partial matching, we may not be able to immediately augment the partial 
matching like in the graph case. What we can do is only swap the corresponding blocking edge (the 
unique ancestor of the unblocked leaf edge) with the leaf edge in the partial matching and continue. 
When there are no longer leaf edges in the alternating tree, the existence of a vertex disjoint 
hyperedge for one of the A vertices in the tree is implied by Haxell’s condition. This alternating 
tree algorithm for hypergraph matchings by Haxell [Hax95], which underlies Theorem 1.1, is not 
known to make fewer than exponentially many modifications (swapping operations) to the partial 
matching before termination (at which point the root is matched). 

To make such a local search algorithm efficient [AKS15] devised a similar but different algorithm 
that ensures a (constant factor) multiplicative increase in the number of blocking edges from layer to 
layer. This guarantees that the height of the alternating tree is always logarithmic, leading to “short” 
augmenting paths. They also avoid making changes to the partial matching unless sufficiently many 
changes can be made at once. In other words, the partial matching is updated lazily. Coupled with 
other ideas, this leads to a polynomial time combinatorial allocation algorithm that achieves their 
main result. For the hypergraph setting, however, this only yields Theorem 1.2. 

Our Techniques One obstacle with the algorithm of [AKS15], is that a single blocking edge can 
block up to r — 1 hyperedges in the same layer. This effect can accumulate across consecutive layers 
preventing the desired growth in the number of blocking edges across the layers of the alternating 
tree, which we require in order to guarantee a logarithmic bound on the height of the alternating 
tree. This makes it important to view the structure of the blocking edges when the layers of the 
alternating tree are constructed. On the other hand the problem with the regular alternating tree 
algorithm for hypergraph matchings [Hax95] is that a single A vertex in a layer can be part of 
an unbounded number of hyperedges in the next layer in the alternating tree. This skews any 
subsequent progress made by the alternating tree algorithm vastly in favor of a few A vertices in 
the previous layers. To avoid this we impose a degree bound on the A vertices in the alternating 
tree, making the progress more balanced among A vertices in the same layer. In Section 4 we show, 
despite imposing this upper bound, a multiplicative growth in the number of blocking edges from 
layer to layer. Next, since the structure of the blocking edges was considered when constructing a 
layer, this creates complications when we modify the partial matching and some layer in the tree. 
For example, when some blocking edges are removed by swapping operations in a layer it may be 
possible to have additional hyperedges for some of the A vertices in the same layer. At this point 
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our algorithm performs a so-called “superposed-build” operation (see Section 3.2) on the layer to 
check if sufficiently many new hyperedges can be included. If so, it commits the changes, otherwise 
it ignores the newly available hyperedges. 

1.1 Related work 

Most relevant to the result of this paper is the line of work concerning alternating tree algorithms 
for hypergraph matchings starting with the work of Haxell [Hax95]. The algorithmic question of 
whether the underlying local search algorithm can be made efficient was not considered until the 
work of Asadpour, Feige and Saberi [AFS12]. They uncovered a beautiful connection to strong 
integrality gaps for configuration linear programs for allocation problems. This direction was sub¬ 
sequently also pursued by Svensson [Svel2] leading to a breakthrough in the context of scheduling. 
Both results [AFS12, Svel2] were non-constructive and only proved integrality gap upper bounds. 
Following these results it became an important question if such approaches based on alternating 
trees can be turned into efficient algorithms with similar guarantees. Polacek and Svensson [PS12] 
obtained significant savings leading to a quasipolynomial time alternating tree algorithm for RE¬ 
STRICTED Max-Min Fair Allocation, but it is still not clear if their approach can be made 
truly polynomial. Building on these ideas, a polynomial time alternating tree algorithm for the 
same problem was obtained by Annamalai, Kalaitzis and Svensson [AKS15]. 

The success of local search for combinatorial problems on hypergraphs where disjointness struc¬ 
ture is crucial has been a recurring theme in the literature on A:-Set Packing [Kar72]. Hurkens 
and Schrijver [HS89] showed a {k/2 + e) approximation algorithm using an intuitive local search 
algorithm. Halldorsson [Hal95] then obtained a quasipolynomial (A: -|-2)/3-approximation. Using a 
different approach this was improved by Cygan, Grandoni, and Mastrolilli [CGM13] to a quasipoly¬ 
nomial time (A: -|- 1 -|- e)/3-approximation. A polynomial time (k + 2)/3-approximation was obtained 
by Sviridenko and Ward [SW13] using color-coding techniques. The best known result for A:-Set 
Packing to date is a (A: -|- 1 -|- e)/3-approximation due to Gygan [Gygl3], and also by Purer and 
Yu [FY14]. It is interesting to note that all of these results are based on local search. We believe 
our techniques to be a useful addition to this repertoire. 

In an important direction of research Ghan and Lau [GL12] consider the power of linear and 
semidefinite relaxations for the A;-Set Packing problem. They show that a particular LP relax¬ 
ation has integrality gap at most {k 1)/2. A different LP relaxation arrived at by applying 0{k‘^) 
rounds of Ghvatal-Gomory cuts to the standard LP relaxation was also shown to have no worse 
integrality gap by Singh and Talwar [STIO]. It remains interesting to consider the applicability of 
“alternating tree” style analyses, as presented in this paper, to better understand the integrality 
gaps of such strong LP and SDP relaxations. 

For a different notion of bipartiteness in hypergraphs, Gonforti et al. [GGKV96] study sufficient 
conditions for the existence of perfect matchings. We also mention that for the case of general 
hypergraphs, sufficient conditions in the spirit of Dirac’s theorem for graphs [Dir52] are known (see 
Alon et al. [AFH”*’12] and references therein). 

2 Preliminaries 

Definition 2.1 (Bipartite hypergraph). An r-uniform bipartite hypergraph H = {A,B,E) is a 
hypergraph on a vertex set partitioned into two sets A and B such that for every edge e € E, 
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\ef^ A\ = 1 and \ef^B\=r — l. 

Let H = (^4, B, E) be a r-uniform bipartite hypergraph. It is important to note that we assume 
that the underlying bipartition of the vertex set is given to the algorithm. We will use n and m to 
refer to |^| and \E\ respectively in H. A subset of edges M <Z E is called a partial matching if any 
pair of edges in the set are disjoint. A partial matching whose edges contain every vertex of A is a 
perfect matching. 

We need some notation for referring to the collection of A vertices and B vertices in a set of 
edges E <Z E. For a subset of edges E C E we use A{F) to denote the set UeeF eCi A. B(F) is 
dehned similarly as UegF eCi B. 

We say that a vertex a G A is matched by a partial matching M if a G A{M). Recall that a 
perfect matching is a partial matching that matches all the vertices of A. 

For the dehnitions that follow consider a hxed partial matching M in H. From the context it 
will always be clear what the considered partial matching is. 

Definition 2.2 (Blocking edges). The set of edges blocking a given edge e G i? is the set 

{/gM| /nenR/0}, 

i.e., it contains edges in M that prevent us from adding e to it. 

Note that M may contain an edge e' such that H e H R = 0 and it matches d in M, where 
{d} = e n A, in which case we may want to also add e' to the set of blocking edges of e but we do 
not do so according to Dehnition 2.2. 

An edge e £ E is called immediately addable if it has no blocking edges. The name reflects the 
property that M U {e} is also a partial matching for such an edge e, unless the A vertex contained 
in e is already matched by M. We refer to an edge e G i? as an edge for a G A if a G e. 

The dehnitions are made with the following simple operation in mind. 

Swapping operation Suppose that a is matched by M through some edge e G M and that there 
is an immediately addable edge f £ E for a. Then the set M \ {e} U {/} is also a partial matching 
that matches exactly the same set of A vertices as M. 

A hnal piece of notation is the following. For a collection of indexed sets {S'o, (Si,..., Sk} we 
write S<t to denote 

ijs.. 

i=0 

Definition 2.3 (Layer). A layer L for a bipartite hypergraph H = {A,B,E) with respect to a 
partial matching M is a tuple {X, Y) where 

• X CE\M, 

• for each pair of distinct edges e,e' £ X, eCi e' D B = 0, 

• y C M is precisely the set of blocking edges of X, and 

• every e £Y intersects exactly one edge from X. 
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Figure 1: Alternating tree arising from some 3-uniform bipartite hypergraph and partial 
matching, depicted here with a root oq and layers Li and L 2 . The edges in Xi and X 2 
appear with a solid border, whereas (blocking) edges in Yf and I 2 appear with a dotted 
border. The second edge from the left with a solid border in L 2 is an example of an 
immedidately addable edge. 


Definition 2.4 (Alternating tree). An alternating tree T for a bipartite hypergraph H = (A, B, E) 
with respect to a partial matching M is a tuple (Lq, ..., L^) such that: 

• Lq = (Xq) ^) is defined to be (0, {oo}) for some oq not matched by M, 

• Li,... ,Li are layers, 

• A{Yi-i) D A(Xi) for all f = 1,..., £, and 

• B{Xi U Yi) n B{Xi: U Yi>) = 0 Vi 7 ^ i' E {0,... 

Oq is called the root of the alternating tree T. The degree of an A vertex d E A(Y<£) is defined to 
be the number of edges from {X<£ U Y<i) \ Yq that contain d. 

Intuition Our goal will be to obtain an augmenting algorithm that takes some partial matching 
M that does not match some oq E A and turns it into a different partial matching M' that matches 
all the vertices of A[M) U {oo}. To accomplish this consider some edge e € E for oq. If it is 
immediately addable then we are done. Otherwise there are some blocking edges of e, call them F, 
that prevent us from adding e to M. To make progress we will try to perform a swapping operation 
on some of the vertices from A{F) thereby reducing the number of blocking edges of e. To do so 
we need to find edges for A{F) which may themselves turn out to be blocked and so on. This 
alternating structure is captured in our definition of a layer and the tree structure that follows is 
the reason behind Definition 2.4. See Figure 1 for an example of an alternating tree. 
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Degree bound Our augmenting algorithm depends on a single parameter /x ;= e^/(10r^). We 
also define U := . As 17 + 1 will turn out to be an upper bound on the degree of any A vertex 

in the alternating tree maintained by the augmenting algorithm, we refer to U as the degree bound. 
Note that every A vertex in an alternating tree T, except for the root, is part of exactly one blocking 
edge, which follows from Definition 2.4 and the fact that M is a partial matching. Therefore, the 
degree bound implies that each non-root A vertex can be part of at most U other (non-blocking) 
edges in the alternating tree. 

Remark 2.5. Without loss of generality we will assume 0 < e < 1. The parameters /x, 17 of our 
augmenting algorithm are set keeping in mind this range of values that e can assume. If we knew 
stronger guarantees about the hypergraph H, for example, for e as large as lOr, then the values of 
these parameters can be set less aggressively and the running time bounds we obtain in later sections 
can also be improved drastically. Our goal here, however, is to show the existence of polynomial 
time algorithms even for a tiny advantage e. 

3 The Augmenting Algorithm 

3.1 The BuildLayer Subroutine 

We first describe a subroutine BuildLayer that is used by the augmenting algorithm. It takes as 
input an alternating tree T, and a pair of sets X,Y E that serve as the initial values for the layer 
that the subroutine constructs. The subroutine augments X and Y and returns them at the end. 

BuildLayer(r, A, Y) : 

(a) We now describe what we mean by an “addable edge” for some given X,Y <Z E and alternating 
tree T. Suppose T = (Lq, ..., Li). For an A vertex a G A{Yi) we say that a has an addable edge 
if i) d has fewer than U edges in X, and ii) 3 edge e £ E for d disjoint from B(X<iUY<iUXUY). 

(b) While there is an a G AiYi) having an addable edge e £ E, add e to A and its blocking edges 
to Y as follows: 

A ^ AU{e}, 

Y £- TU{/GM|/nenB/0}. 

End While. 

(c) Return (A, Y) 

3.2 Main Algorithm 

We now describe the augmenting algorithm. The input to the algorithm is a partial matching M 
along with an A vertex oq that is not matched by M. 

Initialization Initialize layer Lq in an alternating tree T by setting (Ao,lo) •<— (0,{ao})- The 
variable £ will be updated to always point to the last layer in the tree T. Set it to 0. 
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Main Loop Repeat the following two phases in order nntil oq is matched by M. 


(I) Building phase 

(a) Set (X^_|_i, y£_|_i) •(— (0,0). 

(b) (X^+i,y£+i) ^ BuildLayer(r,X£+i, Y^+i). 

(c) Add the new layer := (A^+i, Y^+i) to T. 

(d) Increment £ to .^ + 1. 

(II) Collapse phase Recall that e € E is immediately addable if no edges from M are blocking 
it, i.e., / nenB = 0V/GM. 

While X£ contains more than /ulX^j immediately addable edges, perform the following steps: 
For convenience, we call this set of steps in this iteration, the collapse operation of layer L^. 

(a) For each / G snch that there is an immediately addable edge e G Xi for d G An /, 

M ^ M\{/}U{e}, 

(b) Discard layer from T. 

(c) In this step we perform a superposed-build operation on layer = {X(i_i,Y(i_i) in T. 
Note that this layer is modihed in this step iff the condition in Step II(c)ii is satisfied. 

i. (X;_„y/_,) ^BuildLayer(r,A^_i,y^_i). 

ii. If > (1 + /x)|A,_i| then, (A,_i,y,_i) ^ {X[_^,Yl_^) 

(d) 


End While. 

After the initialization, the main loop of the algorithm consists of repeating the bnild and 
collapse phases in order. The state of the algorithm at any moment is described by the alternating 
tree T = (Lq, ..., L^) and the partial matching M maintained by the algorithm, both of which are 
dynamically modihed. It is not difficnlt to verify that the addition of an extra layer in the 

bnild phase and the collapse operations in the collapse phase modify T and M in legal ways so 
that the resnlting objects are consistent with the dehnitions of an alternating tree and a partial 
matching, respectively. We nse these facts withont mention in the rest of the analayis. 

Also note that set of vertices matched by M always remains the same thronghont the execntion 
of the algorithm nntil a collapse operation on layer Li is performed, after which M additionally 
matches oq, and the algorithm terminates. 

4 Analysis 

We call a layer Li = {Xi,Yi) collapsible if more than fi\Xi\ many edges in Xi are immediately 
addable with respect to M. This is precisely the condition of the while loop in the collapse phase 
of the angmenting algorithm from Section 3.2. 
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Proposition 4.1. Suppose that the alternating tree T = (Lq, ... ,Li) and the partial matehing M 
deseribe the state at the beginning of some iteration of the main loop of the augmenting algorithm. 
Then none of the layers Lq, ... ,Li are eollapsible. As a eorollary it follows that \Yi\ > (1 — 
for eaeh i = 1,... ,i. 

Proof. Suppose that the statement is true at the beginning of the current iteration. During the 
build phase a new layer is constructed. If Li^i is not collapsible then the claim follows for 

the beginning of the next iteration since none of the previous layers were modified in the current 
iteration. If turns out to be collapsible, then by the definition of the collapse phase the layers 
Lq,. .. ,Lt are left at the end of the collapse phase for some t < i (note that t > 1 unless ao was 
matched and the algorithm terminates in the current iteration). The state of each of the layers 
Lq, ..., Lt-i is unchanged from the beginning of the current iteration. Layer L* on the other hand 
could have possibly been modified in Step lie of the collapse phase. However, since it remains part 
of the alternating tree after the collapse phase it implies that Lt, subsequent to any modifications, 
is not collapsible. Therefore none of the layers in the alternating tree are collapsible at the end of 
the iteration (unless the algorithm terminates after the current iteration). Since the claim is true 
for the first iteration, the claim follows by induction on the number of iterations of the main loop 
of the augmenting algorithm. 

The corollary follows since Li is a layer, for each i = 1,..., ^, and, by Definition 2.3, Yi contains 
all the blocking edges of edges in Xi and each edge in Yi intersects (at most) one edge of Xj. □ 

Before we state the next proposition some clarification is necessary concerning the description 
of the augmenting algorithm in Section 3. For instance, in the building phase, there could be many 
vertices a G AfYi) that have an addable edge (as defined in the Section 3.1), and even a a given 
vertex could take many addable edges from which one is eventually chosen. The final state of layer 
L£_|_i, at the conclusion of the build phase, depends on the sum total of such choices. The situation 
is similar in the collapse phase as well. In order to properly specify the algorithm and refer to 
the quantities maintained by it without ambiguity, we assume that there is a total ordering on the 
vertices \n A A B and edges in E, and that these orderings are used to choose a unique vertex and 
edge in any event that many are admissible according to the algorithm description in Section 3. 
This allows us, for example, to refer precisely to the layer after performing a build operation, 
or to the layer after performing a superposed-build operation on layer L^, etc. 

Proposition 4.2. Suppose that the alternating tree T = {Lq, ... ,Li) and the partial matehing M 
deseribe the state at the beginning of some iteration of the main loop of the augmenting algorithm. 
Then the superposed-build operation on Li (while ignoring layers Lj+i,..., Lt) 

(Xf Yl) ^ BuildLayer{{LQ,..., Li),Xi, Yi), 
where Li = {Xi,Yi) satisfies \X[\ < (1 + /x)|Xj| for eaeh i = 1, ... ,L 

Proof. Consider some layer Lt for 0 < t < i present in the alternating tree at the beginning of the 
current iteration. At the iteration when layer Lt was built a superposed-build operation could not 
have increased the size of Xt even by one. If no collapse operations of some layer occurred until 
the current iteration then the situation remains identical, because layer Lt+i was not collapsed in 
particular. If however, some layer was collapsed then it must have an index strictly greater than 
t (since otherwise, the algorithm would have discarded layer Lt in that case). As every time layer 
Lt+i is collapsed, and some edges from Yt are removed, the algorithm tries to augment by a 
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fraction when possible (in Step lie of the collapse phase), it follows that the nnmber of edges in Xt 
cannot increase by more than a /r fraction on performing snperposed-bnild operation on L*. □ 

To ensnre that the algorithm does not get stnek we need to show that, for some state (Lq, ..., L^) 
and M reached at the beginning of an iteration of the main loop, the bnild phase creates a new 
layer with at least one edge. We prove the following stronger statement. 


Theorem 4.3. Suppose T = (Lq, ... ,Tf) is the alternating tree at the beginning of some iteration 
of the main loop of the augmenting algorithm and let Li^i be the newly constructed layer in the 
build phase of the iteration. Then, 


Xi+l\ > 


e 

5r^ 




for each i = 0,... ,i. 


The proof of Theorem 4.3 nses Lemma 4.4 and Lemma 4.5, which we prove below. 

Lemma 4.4. Suppose that \Y<(\ < [Sr^/e] at the beginning of some iteration of the main loop. 
Then when is built in the build phase of the iteration, |^£+i| > 1. 

Proof. By Proposition 4.1, \X<(\ < |1<£|/(1 — fi) < Gr'^fe, where we nse that /r < 1/10. By the 
choice of = e^/(10r^) we then have p,\X<£\ < 1. Therefore, by the invariants from Proposition 4.1 
and Proposition 4.2, every edge in has at least one blocking edge in the tree, and no A vertex 
with less than U edges in X<£ has an edge that is disjoint from B{X<£ U Y<i). Next, no A vertex 
in the tree can have U = [l//r] edges in the tree, as in that case the nnmber of blocking edges for 
that vertex wonld be at least U which is greater than [5r^/e] contradicting onr hypothesis. Taking 
S to be the set of all the A vertices in the tree, so that \S\ = |T<£|, these argnments show that the 
nnmber of B vertices in the tree is an npper bonnd on t{Es). 

We now show that the nnmber of B vertices in the tree is at most (2r — — 1). To see 

this, note that every non-root A vertex in the tree is inclnded in a nniqne edge in M (also in the 
tree), which in tnrn intersects some nniqne edge in X<£. Therefore for each non-root A vertex in 
the tree, we have (r — 1) corresponding B vertices from the matching edge in M and an additional 
set of at most (r — 1) — 1 many B vertices from the nniqne edge in the tree that intersects this 
matching edge. Fnrther, this acconnts for all the B vertices in the tree. We may over connt some 
B vertices in edges that were added as addable edges in the alternating tree bnt this is hne since 
we are only aiming for an npper bonnd. So each non-root A vertex can be thronght to contribnte 
at most (r — 1) -|- (r — 2) many B vertices to the tree. 

However, the gnarantee is that t{Es) mnst be larger than (2r — 3 -|- e)(|S'| — 1), which is a 
contradiction. □ 


We can say something stronger than Lemma 4.4 when the nnmber of edges from M in the 
alternating tree becomes H(r^/e). Notice that this condition is satished when the nnmber of layers 
in the alternating tree is H(r^/e). 

Lemma 4.5. If \Y<£\ > [Sr^/e], at the beginning of some iteration then when layer is built 
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Proof. Suppose that after the build phase constructing layer Li^i is complete, |-^£+i| < 
where <5 := e/(5r^). 

Let Si be the set of A vertices from A{Yi-i) that would take an addable edge if we were to 
perform a superposed-build operation on layer Li while ignoring layers Li+i,... ,L£+i. Formally, 
Si := A{X[ \Xi), where 

{Xl Yl) ^ BuildLayer((Lo,..., L,), Yi). 


Now define S algorithmically (in the sense of performing steps in order) as follows: 

• set S to be the set of all A vertices in 1<£, 

• remove all A vertices from S that have U edges in the alternating tree (i.e., appear U times 
in the edges in X<^+i), 

• remove all A vertices in uf^^Si from S. 

The number of A vertices that have U edges in the alternating tree is at most |X<^+i|/C/. The 
number of A vertices in is upper bounded by /i|X<£| using Proposition 4.2. Therefore, 

| 5 | > \Y<e\-\X<,+i\/U-fi\X<e\. 


By our hypothesis towards contradiction |X£_|_i| < 5|1<£|. Also, by Proposition 4.1, \X<(\ < 
\Y<f\/{l — fi). Putting these together. 


|5|> 


1- u + 


1 


1 




1 — fl J U 1 — /.i 


l><£|- 


As t{Es) > (2r — 3 + e)(|5| — 1), we have 


riEs) > (2r - 3 + e) 


1- u + 


1 


1 




1 — fi J u 1 — M 


|X<,| - 1 


Recall that /i = e^/(10r^) < 1/10. So, after upper bounding the inner sum by 

]^ \ T ,, 1 ,, ^ ^2 ^2 


5 + 




< 1 2 A* e ^ 10 

1 — fiJU'l — fi~ U 1 — fi^ lOr^ lOr^ 9 ^ 2r'^ ’ 


1 

77 + 


we have 


t{Es) > (2r - 3 + e) 


<r2 n 


1 - 


2r2 


|L<,| - 1 


( 2 ) 


Next we obtain an upper bound on t{Es). We start by proving the following claim. 

Claim 4.6. |i?(X<£_|_iUy<£_|_i)| +^|X<£|(r— 1)^ is an upper bound on the cardinality of the smallest 
size hitting set for Eg that is also a subset of B, i.e., an upper bound on t{Es). 


Proof. From the definition of S, every vertex a £ S appears in one of the layers Lq, ... ,Li and has 
strictly less than U edges in the tree. Further, since each o G 5 is not part of this means 

that there is no edge in H for the vertex a that is disjoint from the B vertices in the tree and the 
B vertices introduced in the superposed-build operations in each of the layers Li,... ,Lf. We now 
bound the total number of such B vertices, to prove the claim. 
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The number of B vertices present in the alternating tree T = (Lq, ..., is simply |i?(X<£_|_iU 
l<£_l_i)|. Next, we know by Proposition 4.2 that a superposed-build operation on a layer Li for 
1 < i < .^ produces a layer L' such that \X-\ < (1 + /i)|Xj|. Further, the set of B vertices introduced 
in Xl (as part of an addable edge and their associated blocking edges) not already present in layer 
Li (which was counted previously), is at most fj,\Xi\{r — 1)^—each addable edge along with their 
blocking edges contains at most (r — 1)^ many B vertices. □ 

We now bound the total number of B vertices in layers Lq, in the alternating tree. The 

contribution from layer is at most |X£+i|(r — 1)^ since each addable edge and its associated set 
of blocking edges can introduce at most (r — 1)^ many B vertices. Next, every edge in X<£ is either 
immediately addable or not. The B vertices in immediately addable edges from layers Lq, ... ,Li 
is at most fi\X<£\{r — 1) using Proposition 4.1. The B vertices from layers Lq, ... ,Li that are not 
present in immediately addable edges can be upper bounded simply by (|y<£| — l)(2r — 3) using the 
same argument as in Lemma 4.4. 

Therefore, from Claim 4.6 the following upper bound is then obtained for t{Es): 

(|y<,| - l)(2r - 3) + fi\X<e\ir - 1) + |X^+i|(r - 1)^ + ii\X<i\{r - 1)^. 

We now explain the terms in the bound. The first three terms bound the number of B vertices 
in the alternating tree as we saw above. The final term upper bounds contributions from edges 
not present in the alternating tree but those that could be added during the superposed-build 
operations on each of the layers Li,..., Li. Using the known bounds on X<£ < 1<£/(1 — //) (from 
Proposition 4.1) and < 5|y<f| (by hypothesis). 


t{Es) < |y<£| 



3) + 


L 

1 -/r 


(r 


1) -h 5{r - 1)^ -k ^ (r - 1)^ 
1 — ^ 


For the chosen parameters ^ = e^/(10r^) and S = e/(5r^), we get. 


t(-^s) < [2r - 3 -k e/2] |y<^|. 


From (2) and (3), we have a contradiction when. 


(3) 


[2r-3 + e/2] \Y<£\ < (2r - 3 + e) 


1 



r<£l - 1 , 


which is true for \Y<(\ > [Sr^/e]. 


We now complete the proof of Theorem 4.3. 


□ 


Proof of Theorem f.3. First notice that after a layer Lj+i is built, the number of edges in Xi+i is 
non-decreasing until it is collapsed in some future iteration. Also, any collapse operation on a layer 
leads to discarding that layer. Then the claim follows by combining Lemma 4.4 and Lemma 4.5 to 
note that at the moment when layer Tj+i is created, for some 0 < i < I’, the inequality |Aij+i| > 
e/(5r^) • |y<i| holds. This also remains true in future iterations until a collapse operation occurs 
in layer Lj+i, in which case it will no longer be part of the alternating tree maintained by the 
augmenting algorithm. □ 
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We are now in a position to bonnd the nnmber of layers in the alternating tree at any point in 
the execntion of the angmenting algorithm. 

Lemma 4.7. The number of layers in the alternating tree T = (Lq, ..., L^) maintained during the 
execution of the augmenting algorithm is always bounded by Oilogn). 

Proof. Snppose there are d+l layers Lq, ... ,Li at the beginning of some iteration of the main loop 
of the angmenting algorithm. Consider some layer Li for 1 < i < ^. By Proposition 4.1 less than 
/i fraction of Xi are immediately addable, and hence \Yi\ > (1 — ^)\Xi\. Then by Theorem 4.3 we 
have 

for each i = This qnickly yields n = \A\ > \Y<i\ > (1 + (1 — /r)e/(5r^))^|lo|) so that 

n > (1 + 7 )^, where 7 := (1 — //)e/(5r^) > 0. Altogether this implies that the nnmber of layers ^ at 
any moment in the algorithm is bonnded by O(logn). □ 

5 Signature Vectors 

To keep track of the progress made by the angmenting algorithm we design a potential fnnction. 
For a given state of the alternating tree with layers Lq, ... in total we define the signatnre of 
layer Lj (for 1 <i < t) as: 

signatnre of Li := ^-[log;, , (4) 

where b := The potential fnnction associated with the alternating tree at any state is the 

seqnence obtained by concatenating the signatnres of the individnal layers in order, and finally 
appending the symbol 00 at the end. We refer to this potential fnnction as the signatnre vector. In 
total there are 21+1 coordinates in the signatnre vector which we write as s = (si,..., S 21 , 00 ). 

Lemma 5.1. The lexicographic value of the signature vector reduces across each iteration of the 
main loop in the augmenting algorithm unless the algorithm terminates during that iteration. 

Proof. Snppose the alternating tree T = (Lq, ..., L^) and the partial matching M define the state of 
the algorithm at the beginning of the iteration. Let the signatnre of the corresponding alternating 
tree be s = (si,..., S 2 £, 00 ). We consider two cases depending on whether a collapse operation 
occnrred dnring the collapse phase of the cnrrent iteration. 

• No collapse operation occurred. In this case only the bnild phase of the iteration 

modified the state of the algorithm by adding a new layer L^+i. Thns, the new signatnre of 
the alternating tree is s' = {s'^,..., S 2 £+ 2 ’ where = Si for all i <2i and 52 ^+ 2 ) 

defined as in (4) for layer at the beginning of the next iteration. Clearly the lexicographic 
valne of the signatnre of the alternating tree has rednced. 

• At least one collapse operation occurred. This means that dnring the iteration a new 
layer was bnilt, and one or more collapse operations occnrred in the collapse phase. Let 
primed qnantities denote the variables after the end of the collapse phase in the iteration. 
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Suppose that t (< £ + 1) is the index of the earliest layer that was collapsed among all the 
collapse operations in the collapse phase in the iteration. If t = 1 then oq was matched and 
the algorithm terminates. Otherwise t > 1 and by the description of the algorithm, the only 
layers left in the alternating tree after the collapse phase are Lg,..., L[_i where L' is identical 
to Li for alH < f — 1. Thus the new signature after the collapse phase is s' = ..., 

where s'^ = Si for all z < 2t — 4 and, 


( s 2 i - 3 ) ^'21-2) 



(5r^/e)* 




(5r^/e)* 



When layer Lt was collapsed Step lie of the collapse phase could have possibly modihed layer 
Lt-i- Accordingly there are two subcases. 


= |At_i|. Since there was no modiheation to Xt-i we look at how lt_i has 
changed. As we collapsed layer Lt in the alternating tree, there must have been at least 
^J^-\Xt\ immediately addable edges in Lt- These must have caused the removal of at least 
^\Xt\/Lf many matching edges in Lt-i- Further, by Theorem 4.3, |At| > e/{hr‘^)\Yt-i\. 
Together this means that 

\YU\ < (1 - {^i/U) • e/{hr^))\Yt-i\ < (1 - fi^)\Yt.i\. 


By our choice of the base of the logarithm it holds that log^ ~ + 

logfe = -1 + logfo Therefore, s'aj.g = S2t-3 whereas s' 2 t _2 < S 2 t- 2 - 

— > (1 + fJ,)\Xt-i\. In this subcase the fact that (1 + /z) > b implies that the 

lexicographic value of the signature vector has reduced since S 2 t -3 < S 2 t- 3 - 


□ 

To show that the augmenting algorithm terminates in polynomial time we need one more fact. 

Proposition 5.2. The coordinates of the signature vector are non-decreasing in absolute value at 
the beginning of each iteration of the main loop of the augmenting algorithm. 

Proof. Consider some layer Li for 1 < i < £. Clearly the corresponding pair of coordinates 
S 2 i-i,S 2 i in the signature vector are non-decreasing in absolute value since \Yi\ > (1 — 
using Proposition 4.1. Between any two layers, by Theorem 4.3 we have \Xi\ > e/(5r^)|yi_i| and 
so |s 2 i- 2 | < |s 2 i-i|- Thus the coordinates are non-decreasing in absolute value in the signature 
vector. □ 

Lemma 5.3. The number of signature vectors is bounded by a polynomial in n. 

Proof. By Lemma 4.7 we know that the signature vector has at most O(logn) coordinates. By 
Proposition 5.2 the coordinates are also integers that are non-decreasing in absolute value. At this 
point one can obtain a trivial bound of O(logn) on the absolute value of each coordinate of the 
signature vector using the dehnition in (4) and Lemma 4.7. Since the sign pattern of the signature 
vector is always hxed, each signature vector can be thought to describe a unique partition of some 
positive integer of size at most 0(log^ n). Recall that a partition of a positive integer N is a way of 
writing N as the sum of positive integers without regard to order. Since the number of partitions 
of an integer of size t is (asymptotically) at most for some absolute constant c > 1 [HR18], the 
claim then follows. □ 
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As noticed by one of the reviewers, the dependence of Lemma 5.3 on the asymptotics of the 
partition function can be avoided by modifying the signature vector to ensure that its entries are 
strictly increasing in absolute value (instead of simply being non-decreasing as in Lemma 5.2), 
thereby allowing a signature vector to be inferred by specifying a subset of a set of size at most 
O(logn). One way to get this property is by adding/subtracting i to the Lth coordinate of the 
signature vector, consistent with its sign pattern. 

We are now in a position to use the potential function dehned in this section to wrap up the 
proof of our main result. 

Proof of Theorem 1.3. From Lemma 5.1 we have that every iteration of the main loop of the aug¬ 
menting algorithm described in Section 3 reduces the lexicographic value of the signature vector. 
Lemma 5.3 further tells us that the number of such signature vectors is bounded by a polynomial 
in n. Thus, the augmenting algorithm terminates in polynomially many iterations. It can also be 
verihed that each iteration of the augmenting algorithm can be implemented to run in time poly¬ 
nomial in n and m. Finally, running the augmenting algorithm n times, starting with an empty 
partial matching, yields the desired perfect matching in H. □ 

From the proof of Lemma 4.5 we also note that the algorithm in Section 3 can be suitably 
modihed to yield the following slightly stronger version of Theorem 1.3 as a corollary. 

Theorem 5.4. For every fixed choice o/e > 0 and r > 2, there exists an algorithm A!{e, r) that takes 
as input an r-uniform bipartite hypergraph H = (A,B,E), runs in polynomial time, and terminates 
after finding either: 

• a perfect matching in H, or 

• a set 5 C A such that t{Es) < (2r — 3 -|- e)(|S'| — 1). 

6 Conclusion and Open Problems 

In this paper we presented a polynomial time algorithm for hnding perfect matchings in bipartite 
hypergraphs satisfying a slightly stronger version of Haxell’s condition. The algorithm is essentially 
the natural generalization of the well known Hungarian algorithm for hnding perfect matchings 
in graphs with two essential modihcations: i) restricting the degree of vertices in the constructed 
alternating tree, and ii) performing updates on the alternating tree lazily. The two ideas in tandem 
give us a polynomial running time bound on the procedure. 

One subtlety here is that the algorithm performs lazy updates in two places, in Steps Ila and lie, 
in the collapse phase. While the former is crucial for the running time bound, the latter seems to 
be an artifact of the analysis driven by the specihe choice of the signature vector in Section 4. In 
particular, this can likely be avoided by choosing a different signature vector to measure progress. 
Finally, we point out the obvious open problem in this line of work. 

Question 6.1. Does there exist such an algorithm with a polynomial running time dependence on 
at least one of the parameters 1/e and r? 
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